Universal-to-Specific Framework for Complex Action Recognition

نویسندگان

چکیده

Video-based action recognition has recently attracted much attention in the field of computer vision. To solve more complex tasks, it become necessary to distinguish different levels interclass variations. Inspired by a common flowchart based on human decision-making process that first narrows down probable classes and then applies "rethinking" for finer-level recognition, we propose an effective universal-to-specific (U2S) framework recognition. The U2S is composed three subnetworks: universal network, category-specific mask network. network learns feature representations. generates masks confusing through category regularization output further used guide class-specific entire optimized end-to-end manner. Experiments variety benchmark datasets, e.g., Something-Something, UCF101, HMDB51 demonstrate effectiveness framework; i.e., can focus discriminative spatiotemporal regions categories. We visualize relationship between classes, showing indeed improves discriminability learned features. Moreover, proposed model general may adopt any base

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Universal Representation for Unseen Action Recognition

Unseen Action Recognition (UAR) aims to recognise novel action categories without training examples. While previous methods focus on inner-dataset seen/unseen splits, this paper proposes a pipeline using a large-scale training source to achieve a Universal Representation (UR) that can generalise to a more realistic Cross-Dataset UAR (CDUAR) scenario. We first address UAR as a Generalised Multip...

متن کامل

GPU-Framework for Teamwork Action Recognition

Real time processing for teamwork action recognition is a challenge, due to complex computational models to achieve high system performance. Hence, this paper proposes a framework based on Graphical processing Units (GPUs) to achieve a significant speed–up in the performance of role based activity recognition of teamwork. The framework can be applied in various fields, especially athletic and m...

متن کامل

Action Recognition in the Rhino Cooperative Framework

The University of Rochester's Rhino system is a locally organized cooperative agent architecture that uses observation as its primary means of inter-agent coordination. Observing agents recognize ongoing individual actions, and from sequences of actions reason about likely ongoing plans. Sets of individual plans are combined to form hypothesized group plans. A key element in the group plan infe...

متن کامل

A Discriminative Framework for Action Recognition Using f-HOL Features

Inspired by the overwhelming success of Histogram of Oriented Gradients (HOG) features in many vision tasks, in this paper, we present an innovative compact feature descriptor called fuzzy Histogram of Oriented Lines (f-HOL) for action recognition, which is a distinct variant of the HOG feature descriptor. The intuitive idea of these features is based on the observation that the slide area of t...

متن کامل

An Unsupervised Framework for Action Recognition Using Actemes

In speech recognition, phonemes have demonstrated their efficacy to model the words of a language. While they are well defined for languages, their extension to human actions is not straightforward. In this paper, we study such an extension and propose an unsupervised framework to find phoneme-like units for actions, which we call actemes, using 3D data and without any prior assumptions. To thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2021

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2020.3025665